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Title: METHOD FOR DETERMINING THE IMPACT OF A 

MULTICOMPONENT NATURAL PRODUCT MIXTURE 

Cross-Reference to Related Applications 

This application is the US national phase of international application 
PCT/NL2004/000616, filed 3 September 2004, which designated the U.S. and claims 
priority of EP 03077804.7, filed 5 September 2003, the entire contents of each of 
5 which are hereby incorporated by reference. 

Field of the Invention 

The present invention is related to the field of life sciences and the domain of 
health and diseases, in particular the development of strategies and products for the 
prevention, treatment or curing of a disease. 
10 Brief Summary of the Invention 

The present invention provides a method for determining the impact of a 
multicomponent natural product mixture on the biological profile of a disease 
comprising the steps of: 

(a) determining a biological profile of the disease by comparing the biological profile 
15 of a group of living systems with symptoms of the disease with the biological 

profile of a reference (or healthy) group of living systems, using a multivariate 
analysis; 

(b) determining the impact of a series of samples of the multicomponent mixture on 
the biological profile of the disease, in which samples the concentrations of one or 

20 more natural components or groups of natural components differ, using a 

multivariate analysis; 

(c) determining the composition of the samples of the multicomponent mixture that 
have shown in step (b) a desired impact on the biological profile of the disease, 
using a multivariate analysis; 

25 (d) identifying within the compositions as determined in step (c) the effective 
components or groups of components and their respective concentrations required 
for having the desired impact on the biological profile of the disease, using a 
multivariate analysis. 

The invention also provides a method for preparing a medicament, wherein the 

30 effective natural components or groups of components as identified in step (d) are 
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combined in the respective concentrations required for having an impact on the 
biological profile of the disease. The invention further provides a method for designing 
and controlling the composition of a multicomponent mixture, wherein the 
concentration of at least one natural component of the mixture is adjusted to ensure 
5 that the at least one natural component of the mixture has an impact on a biological 
profile of the disease. The present invention also relates to the use of the present 
method as a tool for optimizing the breeding, cultivation or post harvesting processing 
of natural products for use in natural product-based medicines. 

10 Brief Description of the Drawings 

Figure 1 relates to multidimensional pharmacology and bioactivity profiling 
discovery. 

Figure 2 relates to effects of different herbal mixtures on insulin resistance. 
Figure 3 relates to multivariate data analysis of LCMS profiles in plasma. 

15 

Detailed Description of the Invention 

In contrast with the reductionistic approach as conventionally applied by 
pharmaceutical companies to develop new medicaments, the present invention is 
based on a holistic approach to living organisms. 

20 Such a holistic approach to living organisms has, for instance, been the basis of 

applying herbal medicine such as Traditional Chinese Medicine (TCM) from its 
earliest years. An important starting point in herbal medicine based approaches is 
that every healthy organism is in balance. Balance is considered to be a complex 
interplay between body and mind, which is reflected at all levels ranging from the 

25 biochemical component perspective to the energetic system control of our physical 
body. Internal imbalances can stem from a wide variety of factors and lead to a 
plethora of conditions ranging from short perturbations to chronic disease processes. 
Additionally, in such an approach as in TCM the uniqueness of each human being is 
recognized and drives the necessity to develop a personalized medication to obtain 

30 optimal results based on multi-component treatments. 

At a first glance the "Western" medical approach may seem very different from 
this. However, the revolution in genomics that has taken place in life sciences during 
the last decade has provided considerable support for a more holistic view on 



1656808 



3 

diagnosis and treatment. Furthermore, the issue of personalized medicine is now 
receiving considerable attention due to the new insights in i.a. pharmacogenomics. 
Although the principle of homeostasis has been a cornerstone of Western physiology 
for more than a century, the enormous complexity of biological systems has often 
5 driven pharmaceutical research towards trying to identify and influence only single 
targets that make the difference between health and disease. This approach has 
indeed yielded many potent drugs but also revealed major drawbacks. In fact one tries 
to influence a system by interacting with a single protein that is often part of a 
complex pathway and involved in a cascade of reactions and feed-back loops. The 

10 reality is that most diseases are multi-factorial which means that treating a single 
target provides a partial treatment (reduction of symptoms) and in the majority of 
cases no cure. Although this awareness is not new, it has been impossible to find 
alternative routes giving the mentioned complexity of the system. 

A method has, however, now been developed that enables highly detailed 

15 profiling of complex mixtures and subsequent measurements of the complex 
multicomponent induced changes in biological systems (biological effects) such as in- 
vitro (such as cell cultures) and in-vivo systems (such as animal models, humans). In 
this method the interaction of multiple components in complex mixtures such as 
natural products or extracts and mixtures thereof (including nutraceutical products, 

20 functional food products, herbal medicinal products, other natural compounds and 
biofluids) with living biological systems can very effectively be measured, using a 
particular set of steps wherein technologies are applied such as biostatistics and 
bioinformatics. By means of such measurements the impact of multicomponent 
mixtures on the biological profile of a disease can advantageously be determined. 

25 Moreover, such measurements enable the identification of the effective components 
within the multicomponent mixtures and their respective concentrations required for 
having an impact on the biological profile of the disease can be identified. 

Accordingly, the present invention relates to a method for determining the 
impact of a multicomponent natural product mixture on a biological profile of a 

30 disease within a group of living systems comprising the steps of: 

(a) determining a biological profile of the disease by comparing the biological profile 
of a group of living systems with symptoms of the disease with the biological 
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profile of a reference (or healthy) group of living systems, using a multivariate 
analysis; 

(b) determining the impact of a series of samples of the multicomponent mixture on 
the biological profile of the disease, in which samples the concentrations of one or 

5 more natural components or one or more groups of natural components differ, 

using a multivariate analysis; 

(c) determining the composition of the samples of the multicomponent mixture 
that have shown in step (b) a desired impact on the biological profile of the 
disease, using a multivariate analysis; 

10 (d) identifying within the compositions as determined in step (c) the effective natural 
components or groups of natural components and their respective concentrations 
required for having the desired impact on the biological profile of the disease, 
using a multivariate analysis. 

In a preferred embodiment of the present invention the multicomponent 
15 mixture is optimized towards the optimal activity for curing or preventing the 
development of a disease state of a living system both in view of efficacy and safety 
(cure or treat). 

Hence, the present invention also relates to a method according to the present 
invention, wherein step (d) is followed by a step (e) in which a set of multicomponent 
20 natural product mixtures is prepared on the basis of the information obtained in step 

(d) which mixtures are expected to display the desired impact on the biological profile 
of the disease, whereby step (e) is followed by a step (f) wherein the impact on the 
biological profile of the disease is determined of the set of multicomponent mixtures 
as prepared in step (e), using multivariate analysis. 

25 Accordingly, the present invention also relates to a method for determining the 

impact of a multicomponent natural product mixture on a biological profile of a 
disease comprising the steps of: 

(a) determining a biological profile of the disease by comparing the biological profile 
of a group of living systems with symptoms of the disease with the biological 

30 profile of a reference (or healthy) group of living systems, using a multivariate 

analysis; 

(b) determining the impact of a series of samples of the multicomponent mixture on 
the biological profile of the disease, in which samples the concentrations of one or 
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more natural components or groups of natural components differ, using a 
multivariate analysis; 

(c) determining the composition of the samples of the multicomponent mixture that 
have shown in step (b) a desired impact on the biological profile of the disease, 

5 using a multivariate analysis; 

(d) identifying within the compositions as determined in step (c) the effective natural 
components or groups of natural components and their respective concentrations 
required for having the desired impact on the biological profile of the disease, 
using a multivariate analysis; 

10 (e) preparing a set of multicomponent natural product mixtures on the basis of the 
information obtained in step (d) which mixtures are expected to display the 
desired impact on the biological profile of the disease; and 
(f) determining the impact on the biological profile of the disease of the set of 
multicomponent mixtures as prepared in step (e). 
15 In another preferred embodiment of the present invention, in a step (g) one or 

more of the multicomponent mixtures as prepared in step (e) are selected which 
mixtures display in step (f) a desired and improved impact on the biological profile of 
the disease. The one or more multicomponent mixtures thus selected in step (g) 
display an improved impact on the biological profile of the disease when compared 
20 with the other multicomponent mixtures prepared in step (e). 

In another attractive embodiment of the present invention, the one or more of 
the selected multicomponent mixtures as obtained in step (g) are adjusted by means 
of adding one or more further components to the mixture(s), whereafter the impact of 
the adjusted mixture on the biological profile of the disease is determined using 
25 multivariate analysis. 

Another advantage of the present invention is the fact that multicomponent 
natural product mixtures can be developed within a defined quality range. This is of 
significant importance because the quality of the respective natural components or 
groups of natural components may differ locally due to for instance different weather 
30 conditions during culturing, despite the implementation of good agriculture practice. 
If one or more of the natural components or one or more groups of natural components 
would not be of the desired quality, the multicomponent mixture can be adjusted in 
respect of said one or more natural components or groups of natural components so as 
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to have the desired impact on the biological profile of the disease. This can, for 
instance, be established by adjusting the amount of the respective one or more 
natural components or one or more groups of natural components so as to obtain the 
required concentration(s) of the one or more natural components or one or more 
5 groups of natural components, ensuring the multicomponent mixture will have the 
desired impact on the biological profile of the disease. 

Hence, in a preferred embodiment of the present invention the concentrations 
of at least one natural component or group of natural components of the mixture is 
adjusted to ensure that the at least one natural component or group of natural 
10 components of the mixture have the desired impact on the biological profile of the 
disease. 

In accordance with the present invention, the development and quality control 
of natural product-based medicines can very attractively be established. 

In the context of the present invention, a multicomponent natural product 
15 mixture is defined as a mixture of natural components or groups of natural 
components, which components are not produced by chemical synthesis but by a 
natural process. A multicomponent herbal mixture will, for instance, comprise two or 
more herbs which herbs as such may consist of a number of different components. 

The living systems include human beings and all sorts of animals. When use is 
20 made of animals, the group of living systems is suitably selected from one particular 
type of animal. 

The method according to the present invention enables the measurement of the 
effects of multiple target interventions and the development of products to optimally 
perform such interventions by a unique approach, revealing the biological profile of 

25 the effective components. The present invention provides, therefore, an important 
bridge function between the two complementary approaches used in TCM and 
Western Medicine, because it can reveal besides the effects of simple perturbations 
like a single drug also the complex perturbations using multicomponent mixtures 
such as, for instance, herbal medicine products and functional food products. This 

30 unique approach is referred to as multidimensional pharmacology (MDP) and uses the 
Systems Biology approach in which biological systems are studied by measuring and 
integrating metabolic data and other profile data, such as genetic and/or proteomic 
data. 



1656808 



7 

The determination of biological effects and in particular synergetic 
multicomponent effects in accordance with the present invention is illustrated by the 
example of herbal medicine products in intervention strategies. The determination of 
such effects is not limited to biological and synergetic effects in mammalian systems 
5 but can address all possible forms of living systems with complex mixtures derived 
from the same portfolio of life. 

The present invention is not limited to a particular type of disease, but 
embraces any disease of which a biological profile can be determined and can equally 
be applied in preventive applications. 
10 A considerable advantage of the present invention resides in the fact that it 

provides a scientific basis for both efficacy and safety of the tested multicomponent 
mixtures. 

The method of the present invention allows characterization of the complex 
multicomponent mixtures, in the case of TCM-products, the effects of these products 

15 in mammalian systems to reveal biological effects and the effects of TCM on patients. 

A further very important aspect of the method according to the present 
invention is that it allows the measurement of all biological effects including additive 
and synergetic effects. 

In health and disease studies a body fluid profile of for instance a plasma 

20 sample from a control group (reference group) and patient group (a group with 
symptoms of the disease with the biological profile) can be used to measure as many 
components as possible and is evaluated for differences in single components or 
patterns of components between the two groups to obtain a better insight in the 
underlying biological mechanisms, to detect novel biomarkers/surrogate markers, to 

25 predict toxicology or pharmacological response or to develop novel intervention routes. 
In the context of the present application a biomarker is defined as a characteristic 
that is objectively measured and evaluated as an indicator of: normal biological 
processes, pathogenic processes or pharmacological response to a therapeutic 
intervention. Biomarkers can be genes, transcripts, proteins, metabolites, (trace)- 

30 elements or any combination of those components. 

The concept of the present invention of using patterns for dynamic modeling 
directly or after non-linear or linear multidimensional compression opens the unique 
route of studying dynamical processes using systems descriptions based on component 
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patterns. In such studies perturbations as for instance via drug intervention routes 
can be monitored and evaluated in a multidimensional way and methods such as time 
series analysis, time warping and non-linear dynamic techniques can be applied. In 
the evaluation of data generated with the method of the present invention also the 
5 addition of other data and information from other sources can be of importance such 
as information from clinical dossiers of patients describing the medical diagnosis and 
clinical chemistry, or disease studies data on behavior, cognition, psychological, social, 
etc., levels. These data can be included or linked to the data set created by the method 
according to the present invention to classify patients or to discover sub-classes or 

10 other relevant observations. Especially in the design of new herbal medicine animal 
models are often preferred to yield biological activity, for instance the measure of 
insulin resistance in the study of metabolic syndrome (obesity, diabetes II, 
hypertension and other CV related diseases). The direct correlation of such data with 
the composition of the herbal mixture and the biological response profile in that model 

15 is a preferred embodiment of this invention when new mixtures are studied. When 
mixtures have already been proven to be safe in man or animal in case of veterinary 
applications, biomarker profiles of man can be used directly for the evaluation. 

The application of the present invention creates a wide variety of novel 
approaches in using metabolomics, proteomics and/or other component profiles of body 

20 fluids to enhance the overall process of biomedicine or drug discovery, development 
including clinical evaluation, diagnostic applications and post-marketing surveillance. 
The created profiles can be used first of all to obtain a better insight in the underlying 
biological process a.o. for toxicology evaluation (predictive toxicology), 
biomarker/surrogate marker or biomarker/surrogate marker pattern discovery, 

25 protein target validation, comparison of animal models and relating these to human 
studies, phenotyping typically at the metaboliteand/or protein level, responder/non- 
responder evaluation, validation of animal models such as transgenic models, 
providing the ability to correlate metabolite level data with other levels in systems 
biology such as gene, mRNA, RNAi and protein data. 

30 The method according to the present invention generates patterns of 

components and the used multivariate analysis generates patterns of relevant 
components for a given situation or investigation. In modern biology and related 
nutraceutical, pharmaceutical and biotechnological industries this is a crucial new 
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paradigm for driving the scientific research and industrial discovery and development 
processes. The basic understanding is that biology in general and disease processes in 
particular are multifactorial of nature and therefore the understanding of such 
processes requires a description or understanding based on a multitude of 
5 components. In most cases development processes are for instance based on a single 
target evaluated with a single biomarker for efficacy or for differentiating of control 
groups from patients groups. The present invention, however, provides a method that 
allows the creation of a breakthrough in the ability to describe multifactorial diseases 
by multifactorial patterns as well as multifactorial responses toward multifactorial 

10 input variables such as in combination therapies (chemotherapy) or in evaluation of 
herbal medicine, functional food and nutraceutical responses. 

In accordance with the present invention preferably at least one spectrometric 
technique, at least one electromigration-based technique or at least one 
chromatographic technique is used in step (a) to determine the profile of the disease. 

15 More preferably, use is made of two or more spectrometric techniques but special 
detection techniques for example laser-induced fluorescence and electrochemical 
detection in combination with separation techniques ((nano)-HPLC, electromigration- 
based methods) are included as well. Most preferably, use is made of at least a 
nuclear magnetic resonance technique and/or a mass spectrometry technique to 

20 determine the profile of the disease in step (a). 

The biological profile to be determined in step (a) includes preferably one or 
more metabolic, genetic and/or proteomic profiles. Any combination of these profiles 
can be used. Preferably, such combination includes one or more metabolic profiles. 
More preferably, the biological profile includes metabolic, genetic and proteomic 

25 profiles. 

In step (a) preferably the biological profile is determined of at least one type of 
body fluid or at least one type of tissue. More preferably, in step (a) the biological 
profiles are determined of at least two different types of body fluid. 

The biological profiles are determined in step (a) using one or more of the 
30 following biomarkers; genes, transcripts, proteins, metabolites and (trace) elements. 

In step (b) preferably use is made of at least one spectrometric technique, at 
least one electromigration-based technique or at least one chromatographic technique 
to determine the impact of the series of samples of the multicomponent mixture on the 
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biological profile of the disease. More preferably, use is made of two or more 
spectrometric techniques, whereas most preferably use is made of at least a nuclear 
magnetic resonance technique and/or a mass spectrometry technique. In the case of 
protein profiling other techniques such as gel-based electrophoretic techniques are 
5 included as well. 

In step (c) preferably use is made of at least one spectrometric technique, at 
least one electromigration-based technique or at least one chromatographic technique 
to determine the composition of the samples. More preferably, use is made of two or 
more spectrometric techniques, whereas most preferably use is made of at least a 

10 nuclear magnetic resonance technique and/or a mass spectrometry technique. In step 
(c) the number of samples of which the composition is determined in (c) is preferably 
at least 2, and more preferably in the range of 5-100. 

In step (d) preferably use is made of at least one spectrometric technique, at 
least one electromigration-based technique or at least one chromatographic technique 

15 to identify the effective components or groups of components and their respective 
concentrations required for having an impact on the biological profile of the disease. 
More preferably, use is made of two or more spectrometric techniques. Most 
preferably use is made of at least a nuclear magnetic resonance technique and a mass 
spectrometry technique in step (d). 

20 In step (f) preferably use is made of at least one spectrometric technique, at 

least one electromigration-based technique or at least one chromatographic technique 
to identify the effective components or groups of components and their respective 
concentrations required for having an impact on the biological profile of the disease. 
More preferably, use is made of two or more spectrometric techniques. Most 

25 preferably use is made of at least a nuclear magnetic resonance technique and a mass 
spectrometry technique in step (e). 

The multicomponent mixture to be used in accordance with the present 
invention may be any multicomponent mixture having a (potential) impact on a 
biological profile of a disease. Such multicomponent mixture may be any natural 

30 product. Suitable examples of such mixture include nutraceutical products, functional 
food products, herbal, algae, microbial, fungal medicinal products or (extracts of) 
biofluids. Preferably, the multicomponent natural product mixture is a herbal 
mixture. 
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In each of steps (a)-(d) and (f)(e) preferably use of at least one spectrometry 
technique. Suitably a nuclear magnetic resonance technique ("NMR") or mass 
spectrometry technique ("MS") can be used, whereby the latter technique focuses on a 
limited number of small molecule compounds. Both of these techniques have however 
5 limitations. The NMR approaches are limited in that they typically provide reliable 
information only of compounds present at high concentration. On the other hand, 
global or focused mass spectrometry techniques do not require high concentrations 
but can provide information at a broad screening level or of only limited portions of a 
biological profile. As used herein, the terms "small molecule" and "metabolite" are 

10 used interchangeably. Small molecules and metabolites include, but are not limited 
to, lipids, steroids, amino acids, organic acids, bile acids, eicosanoids, peptides, 
carbohydrates and trace elements. 

Therefore, in each of steps (a)-(d) and (f)(e) preferably use is made of at least a 
nuclear magnetic resonance technique and/or a mass spectrometry technique with a 

15 preference for mass spectrometry in the field of protein profiling (proteomics). 

The situation for small molecules is discussed as an example. Sample 
preparation for NMR can generally be very straightforward using freeze-drying and 
reconstitution in D2O because the focus is on the higher concentration components, 
i.e. of typical concentration > 100 nanogram/mL. For MS a variety of sample 

20 preparation approaches can be used ranging from solid phase extraction and 
liquid/liquid extraction to more specific methods using, for instance, affinity-based 
methods or derivitization procedures both for GC-MS as well as LC-MS. 

In each of steps (a)-(d) and (f)e) use can be made of spectrometric data obtained 
from one or more platforms including, but not limited to, MS, NMR, liquid 

25 chromatography ("LC"), gas-chromatography ("GC"), high performance liquid 
chromatography ("HPLC"), capillary electrophoresis ("CE"), and any known form of 
hyphenated mass spectrometry in low or high resolution mode, such as LC-MS, GC- 
MS, CE-MS, LC-UV, MS-MS, MS n , etc. Typical profile data can also be obtained by 
using more component specific detectors such as laser-induced fluorescence and 

30 electrochemical detection. 

As used herein, the term "spectrometric data" includes data from any 
spectrometric or chromatographic technique. Spectrometric techniques include, but 
are not limited to, resonance spectroscopy, mass spectroscopy, and optical 
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spectroscopy. Chromatographic techniques include, but are not limited to, liquid 
phase chromatography, gas phase chromatography, and electrophoresis. 

If a spectral profile is obtained, as with standard spectroscopic methods, a 
primary necessary step is to adjust for minor shifts in the spectra both in the intensity 
5 dimension as well as in the spectral or chromatographic dimension. Shifts can be due 
to instrumental factors, environmental conditions, or to varying concentrations of 
components (as is often the case in urine analysis). As an example, variation in NMR 
chemical shifts often occurs and needs to be accounted for, but the reproducibility and 
standardization in the intensity (or peak area) of a single profile (quantitation 

10 dimension) is typically very satisfactory. This is in contrast to MS where the peak 
intensity (ion abundance) dimension needs to be carefully adjusted or standardized 
due to lack of calibrants for each component present in the profile. In hyphenated 
techniques the reproducibility of the separation method (GC, LC or electromigration 
driven techniques such as capillary electrophoresis (CE)) needs to be carefully 

15 evaluated as well. In this respect near-infrared spectral profiles are impressive and 
correction in either dimension is hardly required. 

In general, small instrumental shifts in the spectral (variable) dimension will be 
falsely interpreted as representing different components when a collection of data 
profiles is subjected to pattern recognition analysis. A straightforward way to cope 

20 with this problem is by using binning techniques in which the spectrum is reduced in 
resolution to a sufficient degree to insure that a given peak remains its bin despite 
small spectral shifts between analyses. For example, in NMR the chemical shift axis 
may be descretized and coarsely binned, and in MS the spectral accuracies may be 
rounded to integer atomic mass unit values. However, more subtle procedures are 

25 preferred such as partial linear fit for NMR or other alignment procedures for MS. 

After initial data pre-processing the spectral profiles are set for pattern 
recognition (multivariate analysis). 

The ability to use different techniques to produce body fluid profiles is 
optimally used by multiway multivariate analysis and allows the measurement of 

30 different body fluids of the same system (such as plasma and urine or plasma and 
CSF) to reveal novel insights into the systems biology, for instance the effect of the 
blood brain barrier when comparing plasma and urine profiles. Hence, in each of steps 
(a)-(d) preferably use is made of multiway multivariate analysis. 
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The present invention provides a method of spectrometric data processing 
utilizing multiple steps of multivariate analysis to process data in a hierarchal 
procedure (steps (a)-(d) and (f)e)). In each of steps (a)-(d) and (f)e) a first multivariate 
analysis can be used on a plurality of data sets to discern one or more sets of 
5 differences and/or similarities between them, whereafter a second multivariate 
analysis can be used to determine a correlation (and/or anti-correlation, i.e., negative 
correlation) between at least one of these sets of differences (or similarities) and one 
or more of the plurality of data sets. In step (a) the determination of the biological 
profile of a disease may also be based on the correlation. 

10 As used herein, the term "data sets" refers to the spectrometric data associated 

with one or more spectrometric measurements. For example, where the spectrometric 
technique is NMR, a data set may comprise one or more NMR spectra. Where the 
spectrometric technique is UV spectroscopy, a data set may comprise one or more UV 
emission or absorption spectra. Similarly, where the spectrometric technique is MS, a 

15 data set may comprise one or more mass spectra. Where the spectrometric technique 
is a chromatographic-MS technique (e.g., LC-MS, GC-MS, etc), a data set may 
comprise one or more mass chromatograms. Alternatively, a data set of a 
chromatograms or reconstructed TIC chromatograms. In addition, it should be 
realized that the term n data sets" includes both raw spectrometric data and data that 

20 has been pre-processed (e.g., to remove noise, baseline, detect peaks, etc.). 

Moreover, as used herein, the term "data sets" may refer to substantially all or 
a sub-set of the spectrometric data associated with one or more spectrometric 
measurements. For example, the data associated with the spectrometric 
measurements of different sample sources (samples of groups with symptoms of the 

25 disease (experimental group samples) v. samples of reference or healthy groups 
(control group samples)) may be grouped into different data sets. As a result, a first 
data set may refer to experimental group sample measurements and a second data set 
may refer to control group sample measurements. In addition, data sets may refer to 
data grouped based on any other classification considered relevant. 

30 The present invention also provides a method of spectrometric data processing 

utilizing multivariate analysis to process data at two or more hierarchal levels of 
correlation. In each of steps (a)-(d) and (f)e) use can be made of a multivariate 
analysis on a plurality of data sets to discern correlations (and/or anti-correlations) 
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between data sets at a first level of correlation, whereafter the multivariate analysis 
can be used to discern correlations (and/or anti-correlations) between data sets at a 
second level of correlation. In step (a) the determination of the biological profile of a 
biological system may also be based on the correlations discerned at one or more 
5 levels of correlation. 

In accordance with the present invention the spectrometric data processing in 
each of steps (a)-(d) and (f)e) can be carried out utilizing multiple steps of multivariate 
analysis to process data sets in a hierarchal procedure, whereby one or more of the 
multivariate analysis steps further comprises processing data at two or more 

10 hierarchal levels of correlation. For example, in each of steps (a)-(d) a first 
multivariate analysis can be used on a plurality of data sets to discern one or more 
sets of differences and/or similarities between them; a second multivariate analysis 
can be used to determine a first level of correlation (and/or anti-correlation) between a 
first set of differences (or similarities) and one or more of the data sets; and the second 

15 multivariate analysis can be used to determine a second level of correlation (and/or 
anti-correlation) between the first set of differences (or similarities) and one or more 
of the data sets. In step (a) the determination of the biological profile of the disease 
can be based on the correlations discerned at one or more levels of correlation. 

Suitable forms of multivariate analysis include, for example, principal 

20 component analysis ("PCA"), discriminant analysis ("DA"), PCA-DA, factor analysis, 
canonical correlation ("CC"), partial least squares ("PLS"), predictive linear 
discriminant analysis ("PLDA"), neural networks, multilevel/multiway/multiblock 
analysis, iterative target analysis, general procrustus analysis, support vector 
machines ("SVM"), parafac and pattern recognition techniques. 

25 The use of the above-described multivariate analysis techniques is well-known 

in the art. For a more detailed description reference can, for instance, be made to 
pending US Provisional Patent Application , Serial No. 60/312,145 (Method and 
System for Profiling Biological Systems) of which the entire contents are hereby 
incorporated by reference. 

30 To extract the maximum value from the data, multivariate analysis tools as 

outlined above may be used in concert with additional statistical and informatics 
strategies. Once statistically significant differences in, for instance, metabolite 
abundances are determined and quantified among groups of samples, the objective 
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becomes to understand the underlying biological reasons for and the contexts of the 
results. A first step is to identify metabolic components observed in data spectra and 
revealed by multivariate analysis to be significant differences among samples. Such 
identification typically involves querying various databases of known metabolite 
5 component spectra and structures. A next step is to mine existing knowledge about 
molecular interactions through searches of public and private databases. This can go 
some way in explaining the associations and behaviour observed in the metabolomic 
profile results. However, because most of the metabolic, genomic, proteomic, and 
interaction databases depict biochemical events in a static state, progressively 

10 sophisticated analytical and mathematical tools are needed to integrate disjointed 
biological clues into dynamic models that are better suited to explain, for example, 
pathological processes. 

Indeed, both linear and non-linear multivariate analyses may uncover 
statistically significant associations between biomolecular components that will not be 

15 explained through the mining of existing databases or literature. 

A preferred embodiment of the method of the present invention comprises the 
following steps : 

1. A selection is made of the relevant samples, for instance body fluids (plasma, 
urine, CSF, saliva, synovial fluid etc.) 
20 2. A selection is made of the width of the biological profiling; transcripts, proteins, 
metabolites, etc. 

3. A sample is prepared based on the spectrometric techniques to be used for 
determining the biological profile (e.g. GCMS, LCMS, CEMS, MS/MS combinations 
various NMR-methodologies, gel-based electrophoretic techniques etc.) 

25 4. The profile is determined using the spectrometric techniques, gel-based techniques, 
NMR-profiles and preferred MS-approaches in case of metabolomics covering lipids, 
steroids, bile acids, eicosanoids, (neuro)peptides, vitamins, organic acids, 
neurotransmitters, amino acids, carbohydrates, ionic organics, nucleosides, 
inorganics, xenobiotics, etc., with a preference to include peptides. Also a global MS- 

30 profile to describe in a single experiment the major high concentration components 
can be included which is often a good indication for the balance/homeostasis of a 
system in addition to the NMR-profile. 
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5. The data obtained is preprocessed using preferable the technique described in 
Dutch patent application 1016034, in combination with PCA-DA, multiblock/multiway 
multivariate analysis based on linear and non-linear techniques and the partial linear 
fit algorithm. 

5 6. The outcome of 4 is combined with other relevant data sources such as medical 
history, clinical chemistry records, medical description and behavioural, social, 
psychological data, etc. 

7. The (non-linear or linear) dynamics of dynamical diseases are studied by using one 
of the profiling-components or any combination of the profiling components, 
10 preferably by using a combination of non-linear compression and dynamic modeling 
techniques. 

The concept of the present invention is suitably based on the following aspects, 
the profiling of complex mixtures such as body fluids by a combination of NMR and a 
selection of hyphenated mass spectrometric techniques (GC-,LC-,-CE-MS/MS, 

15 ICPMS), the evaluation of the combination with data preprocessing/scaling preceding 
multiblock/multiway multivariate analysis, the combination of instrumental datasets 
created with other relevant datasets, the linking of data sets arising from samples 
from one system but via different body fluid profiles and the ability to study all forms 
of non-linear dynamics. 

20 In cases where biological active components or groups of active components are 

detected and the composition constraints for a desired effect are revealed, the 
knowledge can be used to design mixtures or novel herbal medicine. 

Hence, the present invention also relates to a method for preparing a natural 
product-based medicament, wherein the effective natural components or groups of 

25 components as identified in step (d) are combined in the respective concentrations 
required for having a desired impact on the biological profile of the disease. 

The present invention also relates to the use of a multicomponent mixture as 
prepared in step (e) for preparing a natural product-based medicament. The present 
invention also relates to the use of a multicomponent mixture as selected in step (g) 

30 from the set of mixtures prepared in step (e) for preparing a natural product-based 
medicament. 

The present invention also relates to the use of a multicomponent mixture as 
prepared in step (e) in a food application. The present invention also relates to the use 
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of a multicomponent mixture as selected in step (g) in a food application. Suitable food 
applications include nutraceuticals and functional foods. The invention can also 
suitably be used to determine the safety of food products. 

The invention further relates to a medicament comprising a multicomponent 
5 mixture as prepared in step (e). 

The detection of biological active patterns in mixtures such as herbal products 
enables also the focused quality control or optimized production of plants enabling 
efficient and well-controlled products. This is a crucial step and a major bottleneck in 
the current status of quality control of natural products. Although spectroscopic 

10 techniques have been suggested for controlling the composition of a mixture, see for 
instance WO0047922 (Dunn et a.), the inability to link a concentration change of a 
component or a chance in composition ratio between multiple components in a product 
with biological activity prohibits the mandatory need for concentration-effect control. 
The present invention constitutes therefore a breakthrough as both the profiling of 

15 the multicomponent product and the profiling of the biological response can be 
performed, and both profiles so obtained can be correlated. Therefore, the present 
invention also relates to a method for designing a multicomponent mixture and 
controlling the composition of a multicomponent mixture, wherein the concentration 
of at least one natural component or at least one group of natural components of the 

20 mixture is adjusted to ensure that the at least one natural component or group of 
natural components of the mixture has an impact on a biological profile of the disease. 
The multicomponent mixture can for instance be contained in a plant which quality is 
controlled and adjusted to produce a high quality product. 

The detection of biological active patterns in mixtures such as herbal products 

25 enables also the focused quality control or optimized production of plants enabling 
efficient and well-controlled products. 

In cases where all or part of the biological active components is detected and 
the composition constraints for optimal effect are revealed, the knowledge can be used 
to design mixtures of synthetically obtained components. In the latter case 

30 pharmaceutical preparations can be generated. 

The present invention also relates to the use of the present method for setting 
up breeding programs, Good Agriculture/Manufacture Practice (GAP/GMP) protocols 
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and post-harvesting processing of natural products for use in natural product -based 
medicines. 

5 Example 

The method according to the present invention is schematically depicted in Figure 1. 
A typical experiment for herbal medicine in accordance with the present invention 
based on the fingerprints as generated by the systems biology approach a typical 
experiment is based on the following steps: 
10 1. A number of batches of a mixture or a set of different herbal mixtures, 

preferably with a significant variation in composition, is measured (fingerprinted) by 
a profiling technique such as NMR or mass spectrometry; indicated as batches 1-n in 
the left part of Figure 1. 

2. The profiles of the effect of those batches after administration into an 
15 animal model or in a human trial are measured as well as a reference group which is 

not treated (comprised of both disease animals and wildtype or healthy 
subjects/patients) or chosen to be treated in another way. 

3. The reference group provides the biomarker profile for the disease and the 
other experiments provide the impact of the mixture on the disease pattern and 

20 reveal also other effects. In addition clinical endpoints in human trials or typical 
biological effects measured in cell-models and/or animal models can be used for 
evaluation of the multicomponent mixture. This yields a prove-of-effect on the specific 
disease group. In case a control group is not available comparative analysis can be 
used to reveal the optimal biological effect towards a clinical endpoint or hypothesis. 

25 4. (Non)-linear multivariate correlation of the patterns of the mixture 

components and the effect profiles including also all other information such as clinical 
endpoints or any biological effect in other models (cell based or animal models) 
enables the detection of the patterns of components responsible for the biological 
effects. 

30 5. In designing herbal medicine, the composition of the product can be varied 

and evaluation based on a biological response in an animal model or disease 
biomarker fingerprint can be used to optimize for efficacy and safety. In Figure 2 an 
example is given of a readout of an animal experiment, measuring the effect of herbal 



1656808 



19 

mixtures on insulin resistance and liver safety. Different compositions yield different 
efficacy (Insuline resistance) and also display different safety profiles as measured via 
a liver enzyme (ALAT) in the model. In addition the biomarker fingerprint of each 
mixture can be evaluated as shown in Figure 3, in which the multivariate analysis of 
5 one of the mixtures is given compared to the control (disease) group as a function of 
time. Clearly the effect shows after 4 weeks of treatment and further analysis of the 
data yields component specific and biological pathway or system communication 
specific information. Correlations between the outcome of the experiments related to 
Figures 2 and 3 provide the information which component(s) or groups of components 
10 are responsible for the biological effect. Optimization can be performed with or 
without identification of the components of the mixture, 

6. After designing the optimal multicomponent mixture the fingerprint of the 
mixture can be controlled based on the information as obtained under 4. Profiling of 
multicomponent mixtures such as herbal medicine can be performed with separation 

15 methods but more preferable with spectroscopic techniques such as NMR or mass 
spectrometry techniques using direct infusion with or GC/MS and LC/MS 
methodologies. Direct fingerprinting strategies based Fourier Transform Mass 
Spectrometry are especially attractive given the high resolution capabilities and 
unique identification power, which is for the complexity of herbal medicine products 

20 often preferred. 

7. In case of growing and producing of plants the described invention can be 
used for optimal selection of the conditions of production/culturing of such plants and 
the quality control of the production. 
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